Goto

Collaborating Authors

 sliding attribute


Fader Networks:Manipulating Images by Sliding Attributes

Neural Information Processing Systems

This paper introduces a new encoder-decoder architecture that is trained to reconstruct images by disentangling the salient information of the image and the values of attributes directly in the latent space. As a result, after training, our model can generate different realistic versions of an input image by varying the attribute values. By using continuous attribute values, we can choose how much a specific attribute is perceivable in the generated image. This property could allow for applications where users can modify an image using sliding knobs, like faders on a mixing console, to change the facial expression of a portrait, or to update the color of some objects. Compared to the state-of-the-art which mostly relies on training adversarial networks in pixel space by altering attribute values at train time, our approach results in much simpler training schemes and nicely scales to multiple attributes. We present evidence that our model can significantly change the perceived value of the attributes while preserving the naturalness of images.


Reviews: Fader Networks:Manipulating Images by Sliding Attributes

Neural Information Processing Systems

These attributes are known at training time and for a dataset of faces include aspects like [old vs young], [smiling vs not smiling], etc. They hope to be able to tweak these attributes along a continuous spectrum, even when the labels only occur as binary values. To achieve this they propose an (encoder, decoder) setup where the encoder maps the image x to a latent vector z and then the decoder produces an image taking z, together with the attributes y as inputs. When such a network is trained in the ordinary fashion, the decoder learns to ignore y because z already encodes everything that the network needs to know. To compel the decoder network to use y, the authors propose introducing a adversarial learning framework in which a discriminator D is trained to infer the attributes from z. Thus the encoder must produce representations that are invariant to the attributes y. The writing is clear and any strong researcher should be able to reproduce their results from the presentation here.


Fader Networks:Manipulating Images by Sliding Attributes

Lample, Guillaume, Zeghidour, Neil, Usunier, Nicolas, Bordes, Antoine, DENOYER, Ludovic, Ranzato, Marc', Aurelio

Neural Information Processing Systems

This paper introduces a new encoder-decoder architecture that is trained to reconstruct images by disentangling the salient information of the image and the values of attributes directly in the latent space. As a result, after training, our model can generate different realistic versions of an input image by varying the attribute values. By using continuous attribute values, we can choose how much a specific attribute is perceivable in the generated image. This property could allow for applications where users can modify an image using sliding knobs, like faders on a mixing console, to change the facial expression of a portrait, or to update the color of some objects. Compared to the state-of-the-art which mostly relies on training adversarial networks in pixel space by altering attribute values at train time, our approach results in much simpler training schemes and nicely scales to multiple attributes.